Automated Paraphrase Generation with Over-Generation and Pruning Services
نویسندگان
چکیده
Conversational services are emerging as a new paradigm for accessing information by simply uttering questions in natural language, posing whole set of challenges to the design and engineering systems. Training conversational deal with nuances language often requires collecting high-quality diverse training samples (i.e., paraphrases). Traditional approaches such hiring an expert or crowdsourcing involve data collection processes that costly time-consuming. Automated paraphrase generation is promising cost-effective scalable approach generating samples. Current automatic techniques, however, tend specialise specific types lexical syntactic variations. As result, generated paraphrases may not perform well relevant quality aspects diversity semantic relatedness. In this paper, we follow inspired integration address these issues generate English semantically diverse. We propose extensible reusable pipeline combines paraphrasing techniques two-step process first focus on i) leveraging strengths multiple most (and possibly noisy) paraphrases, then ii) common separate step. Through empirical evaluations show benefits combining more balancing relevance diversity.
منابع مشابه
Paraphrase and Textual Entailment Generation
One particular information can be conveyed by many different sentences. This variety concerns the choice of vocabulary and style as well as the level of detail (from laconism or succinctness to total verbosity). Although verbosity in written texts is considered bad style, generated verbosity can help natural language processing (NLP) systems to fill in the implicit knowledge. The paper presents...
متن کاملParaphrase Generation with Deep Reinforcement Learning
Automatic generation of paraphrases for a given sentence is an important yet challenging task in natural language processing (NLP), and plays a key role in a number of applications such as question answering, information retrieval and dialogue. In this paper we present a deep reinforcement learning approach to paraphrase generation. Specifically, we propose a new model for the task, which consi...
متن کاملNeural Clinical Paraphrase Generation with Attention
Paraphrase generation is important in various applications such as search, summarization, and question answering due to its ability to generate textual alternatives while keeping the overall meaning intact. Clinical paraphrase generation is especially vital in building patient-centric clinical decision support (CDS) applications where users are able to understand complex clinical jargons via ea...
متن کاملApplication-driven Statistical Paraphrase Generation
Paraphrase generation (PG) is important in plenty of NLP applications. However, the research of PG is far from enough. In this paper, we propose a novel method for statistical paraphrase generation (SPG), which can (1) achieve various applications based on a uniform statistical model, and (2) naturally combine multiple resources to enhance the PG performance. In our experiments, we use the prop...
متن کاملParaphrase and Textual Entailment Recognition and Generation
Paraphrasing methods recognize, generate, or extract phrases, sentences, or longer natural language expressions that convey almost the same information. Textual entailment methods, on the other hand, recognize, generate, or extract pairs of natural language expressions, such that a human who reads (and trusts) the first element of a pair would most likely infer that the other element is also tr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-91431-8_25